Speaker Independent Continuous Speech to Text Converter for Mobile Application
نویسندگان
چکیده
An efficient speech to text converter for mobile application is presented in this work. The prime motive is to formulate a system which would give optimum performance in terms of complexity, accuracy, delay and memory requirements for mobile environment. The speech to text converter consists of two stages namely front-end analysis and patte rn recognition. The front end analysis involves preprocessing and feature extraction. The traditional voice activity detection algorithms which track only energy cannot successfully identify potential speech from input because the unwanted part of the spee ch also has some energy and appears to be speech. In the proposed system , VAD that calculates energy of high frequency part separately as zero crossing rate to differentiate noise from speech is used. Mel Frequency Cepstral Coefficient (MFCC) is used as feature extraction method and Generalized Regression Neural Network is used as recognizer. MFCC provides low word error rate and better feature extraction. Neural Network improves the accuracy. Thus a small database containing all possible syllable pronunciation of the user is sufficient to give recognition accuracy closer to 100%. Thus the proposed technique entertains realization of real time speaker independent applications like mobile phones, PDAs etc.
منابع مشابه
TRANSLINGUATOR: Web based application for Speech Translation of Human Voices based on Voice Forensics without Changing the Source Voice
Translinguator is a web based application that can be employed with the help of cloud computing technology. The application can be integrated with devices such as Mobile phones, tablets etc, or it can be designed with dedicated hardware as an independent device. It mainly involves the integration of various existing concepts in a specific sequence to obtain the unique desired output. This appli...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملDistributed Speaker Recognition Using the Etsi Aurora Standard
The ETSI “Aurora” is a standard for distributed speech recognition over the mobile cellular network. We have investigated the use of the features defined in this standard for speaker recognition, in a text-independent system based on Gaussian Mixture Models (GMM). The application context is distributed speaker recognition for user authentication on the mobile cellular network. We have found tha...
متن کاملText-independent Speaker Recognition by Trajectory Space Comparison
We present the principle of trajectory space comparison for text-independent speaker recognition and some solutions to the space comparison problem based on vector quantization. The comparison of recognition rate of diierent solutions is reported. Experimental system achieved 99.5% text-independent speaker recognition rate for 23 speakers, using 5 phrases for training and 5 for test. A speaker-...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1307.5736 شماره
صفحات -
تاریخ انتشار 2013